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A  goal  of  the  Air  Weather  Service  Is  to  be  able  to  state  the 
probability  that  a  weather  element  will  have  a  value  above  a  specified 
threshold  for  any  location  at  any  time.  Many  models  have  been  devel¬ 
oped  for  modeling  such  weather  elements  as  visibility,  ceiling,  sky 
cover,  precipitation,  windspeed,  etc.,  for  locations  where  records 
exist.  The  modeling  for  locations  where  there  are  no  records  is  a  more 
difficult  task.  This  report  develops  and  evaluates  some  new  models  for 
estimating  visibility  probabilities  for  German  locations  where  no 
records  exist.  The  models  require  only  knowledge  of  the  elevation  of 
the  location  and  the  average  of  the  elevations  a  distance  of  20  kilo¬ 
meters  from  the  location  in  order  to  estimate  visibility  probabilities 
for  a  specified  month  and  hour  period.  The  next  section  outlines  the 
work-to-date  by  the  present  authors  on  the  problem  of  predicting 
visibilities  at  locations  where  no  records  exist. 

2.  Background  of  Previous  Work  on  ''Spreading11  Visibility 

At  the  University  of  Central  Florida,  under  the  sponsorship  of  the 
Air  Force  Geophysical  Laboratory,  several  technical  reports  have  been 
prepared  on  the  subject  of  estimating  visibility  in  data  void  regions, 
with  particular  reference  to  Germany.  In  AFGl-TR-81-0144,  "Some  Models 
for  Visibility  for  German  Stations,"  the  Weibull  distribution  was  used 
to  fit  visibility  data  for  30  stations  in  Germany.  For  each  of  the  30 
stations,  values  of  o  and  (S  were  obtained  for  each  month  and  for  all 
3-hour  periods.  For  any  distance  x,  the  probability  F(x)  of  visibility 
less  than  a  distance  of  x  miles  is  then  given  by  the  formula 


Measures  of  the  accuracies  of  the  predictions  were  also  obtained. 
These,  however,  were  based  on  the  residuals  of  the  least  squares  fits; 
and,  as  such,  the  standard  deviations  would  be  expected  to  be  under¬ 
estimates.  A^o,  no  attempt  was  made  to  predict  visibility  for 
locations  where  no  records  existed. 
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The  first  report  in  which  an  attempt  was  made  to  predict  visi¬ 
bility  for  locations  where  no  records  existed  was  AFGL-TR-81-0313, 
"Modeling  Visibility  for  Locations  in  Germany  Where  No  Records  Exist." 
In  this  report,  the  models  were  formulated  by  using  the  values  for  a 
and  &  from  AFGL-TR-81-0144  and  regressing  these  values  on  a  large 
number  of  variables  which  were  potential  predictors  of  the  values  for  a 
and  fi.  These  included  such  variables  as  station  elevation,  relative 
elevation,  proximity  to  water,  mean  windspeed  and  various  combinations 
(interactions)  of  these  and  other  variables.  The  prediction  equations 
for  o  and  &  were  then  refined  by  a  sophisticated  nonlinear  regression 
program  in  which  the  sums  of  squares  of  the  residuals  minimized  were 
not  the  residual  differences  between  the  values  for  o  and  &  and  their 
predicted  values  but  were  the  differences  between  the  empirical 
cumulative  distribution  function  and  the  model  cumulative  distribution 
function.  This  latter  regression  produced  a  set  of  equations  for 
predicting  a  and  &  from  geographic  and  climatic  predictor  variables. 
One  set  of  equations  was  produced  for  each  month  and  time  of  day. 
These  models  were  designated  collectively  as  the  "Variables  Model." 

A  second  set  of  models  was  produced  by  again  using  the  above 
mentioned  nonlinear  regression  program;  this  time  using  none  of  the 
predictor  variables.  The  result  was  a  pair  of  constants  a,  3  for  each 
month  and  time  of  day  which  gave  the  best  fits  for  all  stations.  These 
models  were  designated  collectively  as  the  "Constants  Model." 

The  measures  of  accuracy  of  the  models  used  in  this  report  were 
the  resulting  "mean  square  of  the  residuals."  Since,  again,  this  is 
essentially  an  internal  estimate  and  is  based  on  the  same  data  used  to 
produce  the  models,  the  RMS  values,  when  interpreted  as  accuracy 
measures  are  optimistic  estimates. 

The  third  paper  In  the  series,  AFGl-TR-82-0187,  "Evaluation  of  an 
Observation-based  Climatology  Model  for  Predicting  Visibility  for  Data 
Void  Locations  in  Germany,"  gives  two  independent  evaluations  of  the 
"Constants"  model  developed  in  the  previous  report.  Two  methods  were 
used.  The  first  was  a  traditional  method.  The  original  30  stations 
were  regarded  as  a  "calibration"  set.  A  second  set  of  30  stations  was 
used  as  an  "evaluation"  set.  The  second  method  involved  a  relatively 
new  technique  called  "sample  reuse"  (also  sometime*  called  cross- 


validation),  whose  use  has  only  recently  been  made  practical  due  to  the 
availability  of  "cheap"  computing. 

The  "sample  reuse"  method  evaluations  were  entirely  consistent 
with  the  evaluations  based  on  "calibration"  and  "evaluation"  sets,  and 
the  report  may  be  said  to  have  confirmed  the  use  of  "sample  reuse." 
Values  of  a  and  &  for  the  "Constants"  model  were  recalculated  making 
use  of  all  60  stations. 

The  report  AFGL-TR-82-0239,  "Estimating  a  Family  of  Distributions 
with  Applications  to  Climate  Spreading,"  was  an  attempt  to  make  avail¬ 
able  to  a  wider  audience  some  methodologies  developed  in  previous 
reports  and,  in  particular,  the  work  reported  in  AFGL-TR-81-0144.  The 
report  is  an  excellent  example  where  the  solution  of  a  practical 
problem  resulted  in  the  development  of  new  methodologies  and  basic 
research  applicable  to  a  much  wider  class  of  problems.  Publication  of 
this  research  in  the  scientific  literature  is  expected. 

The  present  report  was  first  intended  to  be  an  independent  evalua¬ 
tion  of  the  "Variables"  model  of  AFGL-TR-81-0144.  However,  since  that 
model  included  as  predictors  for  a  and  3  variables  which  would 
ordinarily  not  be  available  for  a  station  where  no  records  existed,  and 
since  AFGL-TR-82-0187  demonstrated  the  ability  of  "sample  reuse"  to 
produce  a  valid  independent  evaluation  of  those  models,  it  was  decided 
to  proceed  otherwise. 

Models  were  developed  for  each  month  and  hour  period  (except  hour 
periods  00-02  and  03-05)  using  only  the  variables  elevation  and  rela¬ 
tive  elevation,  whose  values  would  ordinarily  be  available  at  locations 
with  no  previous  records  for  visibility,  and  which  had  previously  been 
confirmed  to  be  useful  for  prediction  purposes.  Sample  reuse  was  used 
to  produce  independent  estimates  of  the  accuracy  of  the  models. 

3.  Methodology 

Let  E.(x.)  be  the  empirical  probability  (step  function)  that  the 

visibility  is  less  than  x .  miles  for  the  jth  station,  using  data  from 
the  RUSSWOs.  The  values  for  x..  are  the  following  distances  in  miles: 
1/4,  5/16,  1/2,  5/8,  3/4,  1,  5/4,  3/2,  2,  5/2,  3,  4,  5  and  6.  Let 


where 


Fj(x;  “ 


+  y. 


+  6, 


* 

* 


EL.  +  Jfp  *  AE. 

J  C  J 

ELj  ♦  62  *  AEj. 


EL.  and  AE.  are  transformed  and  scaled  values  for  the  elevation  at 

J  J 

station  j,  and  the  average  elevation  at  20  equispaced  locations  on  a 

circle  of  radius  20  kilometers  and  centered  at  the  station.  EL  is  the 

g 

cube  of  the  elevation  in  feet,  divided  by  10  .  AE  is  the  cube  of  the 

g 

average  elevation  In  feet,  divided  by  10  .  The  constants  Yg,  Yj, 

6q,  5j,  5 ^  are  determined  by  minimizing  the  expression 


X  X  C Ej ( x  1 )  -  E j ( x  1 ;  ay  Bj)]2 

J  * 

That  Is,  the  constants  are  those  which  minimize  the  sum  of  squares  of 
the  differences  between  the  empirical  and  model  probabilities  over  all 
stations  and  all  distance  for  which  data  are  available.  The  constants 
Yq,  Yj  and  are  given  in  Exhibit  2.1,  "Model  Coefficients  for  Alpha," 
and  the  constants  6g,  5j,  62  are  given  In  Exhibit  2.2,  "Model 
Coefficients  for  Beta." 

The  models  were  evaluated  by  sample  reuse.  The  method  of  sample 
reuse  may  be  briefly  described  as  follows.  If  there  are  n  stations,  n 
separate  models  are  developed.  For  each  model,  one  station  is  used  as 
the  evaluation  set,  with  the  remaining  n-1  used  as  the  calibration  set. 
For  each  station,  the  root  mean  square  (RMS)  error  of  the  modeling 
procedure  is  obtained.  The  RMS  values  for  the  Individual  stations  are 
then  pooled  to  obtain  an  overall  RMS  error  for  the  procedure.  The  RMS 
values  by  month  and  hour  are  given  in  Exhibit  2.3. 

Exhibit  2.4  gives  the  RMS  for  each  station,  averaged  over  all 
months  and  hour  periods.  The  first  column  of  RMS  values  gives  the  RMS 
values  from  sample  reuse,  that  is,  when  all  of  the  other  stations  were 
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used  in  the  model  building  process,  and  the  specified  station  was  used 
as  the  "evaluation  set."  The  RMS  values  indicate  how  well  that 
visibility  would  have  been  predicted  if  the  station  had  been  a  station 
"for  which  no  records  exist,"  using  a  model  based  on  all  the  other 
stations.  For  54  out  of  the  60  stations,  prediction  would  have  been 
good.  The  stations  which  would  not  have  been  well  predicted  are 

Baumholder,  Siegenberg,  Kahler  Asten,  Plezen,  Feldberg  and  Grosser 
Falk.  The  latter  six  stations  have  previously  presented  modeling 

problems.  The  second  column  of  RMS  values  are  the  RMS  values  from  a 
full  regression  for  the  model  using  all  60  stations.  Two  things  ar 
remarkable.  First,  Feldberg  and  Grosser  Falk  show  up  as  remarkab' 
influential  in  the  regression.  This  suggests  that  for  both  station 
there  is  a  prediction  variable  that,  if  used  in  the  model,  would  ha\ 
greatly  improved  prediction  for  these  stations.  Second,  for  the  othe 
58  stations  the  RMS  values  are  little  changed.  This  suggests  (at  least 
to  the  authors)  that  the  use  of  the  variables  of  elevation  and  relative 
elevation  results  in  a  quite  good  model  for  predicting  visibility 
probabilities  for  specific  months  and  hour  periods. 

Exhibit  2.5  gives  the  RMS  values  that  resulted  when  all  60 
stations  were  used  to  generate  the  model  coefficients  given  in  Exhibits 
2.1  and  2.2.  These  are  the  RMS  values  over  all  stations  and  are 
internal  estimates.  Exhibit  2.3,  and  not  Exhibit  2.5,  should  be  used 
as  a  measure  of  the  ability  to  predict  visibility  for  locations  in 
Germany  where  no  visibility  records  are  available.  One  reason  for 
retaining  Exhibit  2.5  in  this  report  is  to  indicate  that  RMS  values 
obtained  from  the  regressions  producing  the  model  are  overestimates  of 
the  model  accuracies  and  instead  an  independent  estimate  should  be 
used.  It  is  worth  repeating  that  sample  reuse  does  provide  independent 
estimates  of  the  model  accuracies!  This  is  accomplished  despite  the 
fact  that  we  do  not  have  two  sets  of  data — one  for  the  determination  of 
the  model  accuracies  and  one  for  the  evaluation  of  the  model. 

4.  Use  of  the  Model 

Problem  1:  Find  o  and  f!  for  Koblenz  for  March  0900-1100  hours  and 
use  it  to  find  probability  of  visibility  less  than  1.5,  2.0  and  6.0 
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miles.  The  elevation  of  Koblenz  Is  318  feet  and  the  average  elevation 
of  the  surrounding  area  Is  900  feet. 

Solution:  From  Exhibits  2.1  and  2.2  we  have 
a  =  .0848  +  .00580  (318)3/109  -  .00178  (900)3/109  =  .0837  and 
8  =  1.26  -  .00983  (318)3/109  -  .0105  (900)3/109  =  1.252. 

Estimated  probability  of  visibility  less  than  or  equal  to  x  miles  Is 

.  -.0837X1-252 

l  -  e 

The  probabilities  of  visibilities  less  than  or  equal  to  distances  of 
1.5,  2.0  and  6.0  miles  are  thus  estimated  as  .130,  .181  and  .546. 
These  compare  to  the  RUSSW0  values  of  .090,  .156  and  .525. 

Problem  2:  Find  a  and  8  for  Berus  for  September  1800-2000  hours 
and  use  It  to  find  probabilities  of  visibility  less  than  1.5,  2.0  and 
6.0  miles.  The  elevation  of  Berus  Is  1204  feet  and  the  average  eleva¬ 
tion  of  the  surrounding  area  is  1050  feet. 

Solution:  From  Exhibits  2.1  and  2.2  we  have 
a  =  .0174  +  .00305  x  (1204)3/109  -  .000689  (1050)3/109  =  .0219  and 
8  =  1.40  -  .0111  x  (1204)3/109  -  .00670  (1050)3/109  =  1.373. 
Estimated  probability  of  visibility  less  than  or  equal  to  x  miles  is 

.  *.0219x1,373 

1  -  e 

The  probabilities  of  visibilities  less  than  or  equal  to  distances  of 
1.5,  2.0  and  6.0  miles  are  thus  estimated  as  .037,  .055  and  .226. 
These  compare  to  the  RUSSW0  values  of  .019,  .027  an-1  .140. 

5.  Summary 

Given  only  month,  time  of  day,  and  the  elevation  and  surrounding 
elevations  for  an  arbitrary  location  in  Germany,  models  are  developed 
for  estimating  probabilities  of  visibilities  less  than  a  specified 
distance.  The  two-parameter  Wei  bull  model  is  used.  For  a  given  month 


and  hour  period,  the  parameters  are  functions  of  elevation  and  average 
elevation  of  the  surrounding  region.  The  models  are  developed  using  a 
somewhat  sophisticated  nonlinear,  least  squares  regression  program. 
Evaluation  is  accomplished  using  "sample-reuse, "  a  relatively  new 
technique  in  which  a  valid  independent  evaluation  of  the  model  is  made 
using  the  same  observations  used  to  develop  the  model. 
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Exhibit  2.2 

Model  Coefficients  For  8 


Permit  fu'i 


10 


WMO 

Station 

Lat 

(North) 

10  6  16 

HAHN  A* 

49.95 

10610 

BITBURG  A6 

49.95 

10616 

RAHSTE  IN  A E* 

49.43 

10607 

SPANGDAHLEM  Aft 

49.97 

10384 

TENPELHOF  APR! 

52.47 

10755 

ANSBACH  AAF 

49.32 

10544 

FULDA  AAF 

50.53 

10869 

ERDING  AS 

48.32 

10765 

FEUCHT  AAF 

49.38 

10618 

BAUMHOLDER  AAF 

49.65 

10626 

BAD  KREUZNACH  AAF 

49.87 

10971 

BAD  TOLZ  AAF 

47.77 

10714 

ZUEIBRIJCKFN  AB 

49.22 

10633 

WIESBADEN  AB 

50.05 

10633 

FINTHEN  AAF 

49.97 

10763 

FURTH  AAF 

49.50 

10^42 

MANAU  AAF 

50.17 

10857 

GABLINGEN  AAF 

48.45 

10653 

GIEBELST ADT  AUX  af 

49.67 

10687 

GRAFENUOHR  AAF 

49.70 

10734 

HE  I  DEI  BERG  AAF 

49.40 

10752 

ILLESHEIH  AAF 

49 .47 

10659 

KITZINGF.N  AAF 

49.75 

10763 

NUPNBERG 

49.50 

10729 

COLEMAN  AAF 

49.57 

10657 

UERTHEIM  AAF 

49.77 

10745 

SCHUAEBISCH  HALL  AA 

49.17 

10712 

SEMBACH  AB 

49.52 

10367 

S IEGENBERG  GUNNERY 

48.75 

10738 

ECHTERDINGEN  ARPT 

48.68 

10501 

AACHEN.  DL 

50.78 

10224 

BREMEN.  GER 

53.05 

10305 

LINGEN.  GER 

52.52 

10427 

KAHLER  6STEN .  GER 

51.18 

11157 

aigen  fhnstal,  OS 

47.53 

1  1446 

PLEZEN/OQBRA.  CZ 

49.67 

10900 

BREMGARTEN.  GER 

47.90 

10948 

OBERSTDORF,  GER 

47.40 

10929 

KONSTANZ.  GER 

47.68 

11120 

INNSBRUCK.  OS 

47.27 

11150 

SALZBURG.  OS 

47.80 

10893 

PASSAU.  GEP 

48 . 58 

10908 

FELDBERO.  OEK 

47.37 

i  ov  :•  i 

NEUHAUScN.  DL 

47.98 

10510 

NliRBUR  j »  GER 

50 . 33 

10515 

KOBLENZ  >  DL 

50.35 

10532 

G I  ESSE  N .  GER 

50.57 

10542 

HERSFELL .  DL 

50 .87 

10658 

KISSINuE.n.  DL 

50.20 

106.1 

COBURG.  GER 

50.27 

10635 

HOF.  GER 

50.3  2 

10704 

BERUS.  GFf: 

49 . 27 

10727 

KARLSRUHE.  GER 

49.02 

10742 

OHRINGEN.  GER 

49.20 

10788 

STAU8ING.  GER 

48.87 

10791 

GROSSER  FALK.  GER 

49.08 

10805 

LAHR.  GER 

48.37 

10837 

LAUDHEIM.  GER 

48.22 

10953 

KAUFBEUREN.  DL 

47.87 

10375 

KUHLDORF.  GER 

48.25 

Long 

Rel 

Sample 

Re-use 

Full 

Fit 

(East) 

Elev. 

Elev. 

RMS 

RMS 

7.27 

1650. 

1267. 

.062 

.048 

6.57 

1228  . 

1363. 

.044 

.042 

7.58 

..780-* 

•  7  VO. 

.051 

.052 

6.70 

1196. 

1  330. 

.057 

.035 

13.40 

164. 

164. 

.058 

.038 

10.63 

1  542  . 

1181. 

.068 

.064 

V .  63 

1010. 

1650  < 

.045 

.046 

11.93 

1522  . 

1525. 

.  064 

.062 

11.18 

1265. 

1407. 

.046 

.045 

7 . 30 

1408 . 

1538. 

.153 

.  154 

7.88 

355. 

825. 

.071 

.071 

11.60 

2260. 

2688. 

.  121  . 

.  102 

7 .40 

1132. 

938. 

.04  1 

.040 

8.33 

470. 

617. 

.036 

.036 

8.15 

769. 

948. 

.048 

.047 

10.95 

1000. 

1225. 

.085 

.085 

8. 95 

377. 

408 . 

.037 

.036 

10.87 

1530. 

1  550 . 

.084 

.082 

9.88 

985 . 

930. 

.060 

.060 

11 .9S 

3  370  . 

1510. 

.063 

.066 

8.65 

369. 

369. 

.034 

.034 

10.38 

1060  . 

1250. 

.  084 

.084 

10.20 

699  . 

899  . 

.051 

.052 

11.08 

1053. 

1053. 

.062 

.063 

8.47 

334. 

350. 

.078 

.077 

9.48 

1120. 

1075. 

.095 

.096 

V  .  78 

1  303  . 

16  32. 

.046 

.046 

7.87 

1052. 

1315. 

.037 

.037 

11.80 

1325. 

1450. 

.140 

.140 

9.22 

1306. 

1200. 

.056 

.054 

6.12 

673. 

775. 

.081 

.080 

8.80 

502. 

250. 

.043 

.043 

7.33 

549  . 

PSO. 

.040 

.040 

8.50 

2822  . 

1563. 

.204 

.188 

14.15 

2139. 

3350. 

.056 

.053 

13.30 

1194  . 

1400. 

.130 

.131 

7.63 

699  . 

1438. 

.044 

.041 

10.J0 

2664  . 

3350. 

.097 

.080 

9.20 

1368. 

1725. 

.037 

.036 

11.37 

1962. 

3400. 

.069 

.074 

13.02 

1  463  . 

2338. 

.048 

.047 

13.50 

1335. 

1613. 

.056 

.055 

8.02 

4898  . 

2550. 

.280 

.069 

8.92 

2648. 

2463. 

.072 

.063 

6 .97 

2064  . 

1 125. 

.085 

.0  77 

7.60 

318. 

900. 

.066 

.068 

8 . 72 

640, 

825 . 

.059 

.060 

9.72 

7  38. 

1  200. 

.059 

.060 

10.10 

704  . 

1213. 

.039 

.040 

10.97 

1106. 

1338. 

.038 

.039 

11.90 

1864  . 

1938. 

.048 

.057 

6. 70 

120  4. 

1  050. 

.  056 

.054 

8 . 40 

394  . 

600  . 

.041 

.041 

9.53 

90y  . 

1075. 

.049 

.049 

12.60 

1  155 . 

1350. 

.084 

-.085 

13.30 

4291  . 

2475. 

.259 

.064 

7.85 

509. 

800. 

.074 

.073 

9  .  y3 

1  765. 

1938. 

.053 

.056 

10.63 

23H8  . 

2513. 

.069 

.062 

12.55 

1319. 

1625. 

.053 

.053 

Exhinit  2.^ 
RMS  By  Station 
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Exhibit  2.5 

RMS  Bv  Month  and  Hour  (Overall  Fit) 
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Copy  available  to  Dl’IC  uoes  not 

permit  fully  legible  reproduction. 


